Multimedia apparatus and multimedia system

ABSTRACT

A multimedia apparatus includes a wireless transmission device, an audio receiving device, a processing device and a projection device. The audio receiving device is for receiving a first voice data. The processing device is coupled with the audio receiving device and the wireless transmission device. The processing device is for outputting the first voice data via the wireless transmission device. The projection device is coupled with the processing device. When the processing device receives a first image data corresponding to the first voice data via the wireless transmission device, the processing device operates the projection device to project a first projection picture according to the first image data. A multimedia system including the multimedia apparatus is also disclosed.

RELATED APPLICATIONS

This application claims priority to Taiwan Application Serial Number106117516, filed May 26, 2017, which is herein incorporated byreference.

BACKGROUND Field of Invention

The invention relates to a media apparatus and, in particular, to amultimedia apparatus including a projection device and a multimediasystem.

Related Art

Along with the development of technology, various multimedia electronicproducts, such as smartphones or smartwatches, have been designed ordeveloped. Due to the evolution of technology, modern multimediaapparatuses tend to be developed toward the direction of providingmultiple functions. Particularly, more and more multimedia apparatusesprovide users with voice control via voice recognition functionalities.However, the simple multimedia operations provided by traditionalmultimedia apparatuses cannot satisfy the needs of the user or newmultimedia applications. Therefore, how to design a multimedia apparatusto provide convenient and diversified multimedia operations and gooduser experiences is an important issue.

SUMMARY

The invention provides a multimedia apparatus and a multimedia systemthat can receive the voice data provided by a user, and can projectcorresponding image data based on the recognition result of the voicedata provided by the user.

The multimedia apparatus of the invention includes a wirelesstransmission device, an audio receiving device, a processing device anda projection device. The audio receiving device is for receiving a firstvoice data. The processing device is coupled with the audio receivingdevice and the wireless transmission device. The processing device isfor outputting the first voice data via the wireless transmissiondevice. The projection device is coupled with the processing device.When the processing device receives a first image data corresponding tothe first voice data via the wireless transmission device, theprocessing device operates the projection device to project a firstprojection picture according to the first image data

The multimedia system of the invention includes a cloud system and amultimedia apparatus. The cloud system has a voice recognition deviceand an external server. The multimedia apparatus includes a wirelesstransmission device, an audio receiving device, a processing device anda projection device. The wireless transmission device is coupled withthe cloud system. The audio receiving device is for receiving a firstvoice data. The processing device is coupled with the audio receivingdevice and the wireless transmission device. The processing device isfor outputting the first voice data via the wireless transmission deviceto the cloud system. The voice recognition device is suitable forrecognizing the first voice data to obtain a recognition result, and theexternal server outputs a first image data according to the recognitionresult to the processing device. The projection device is coupled withthe processing device. When the processing device receives the firstimage data corresponding to the first voice data via the wirelesstransmission device, the processing device operates the projectiondevice to project a first projection picture according to the firstimage data.

Based on the above, the multimedia system including the cloud system andthe multimedia apparatus can provide interactive and diversifiedmultimedia operation functions and good user experiences based on thevoice data of the user.

The invention will become more fully understood from the detaileddescription and accompanying drawings, which are given for illustrationonly, and thus are not limitative of the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram showing the multimedia apparatus accordingto an embodiment of the invention.

FIG. 2 is a side view of the multimedia apparatus shown in FIG. 1.

FIG. 3 is a schematic diagram showing the multimedia system according toan embodiment of the invention.

FIG. 4 is a schematic diagram showing the emoticon graphics according toone embodiment of the invention.

FIG. 5 is a schematic diagram showing the multimedia system according toanother embodiment of the invention.

FIG. 6 is a schematic diagram showing the multimedia system according toanother embodiment of the invention.

FIG. 7 is a schematic diagram showing a multimedia system according toanother embodiment of the invention.

FIG. 8 is a schematic diagram showing a multimedia system according toanother embodiment of the invention.

DETAILED DESCRIPTION OF THE EMBODIMENT

The present invention will be apparent from the following detaileddescription, which proceeds with reference to the accompanying drawings,wherein the same references relate to the same elements.

FIG. 1 is a schematic diagram showing the multimedia apparatus accordingto an embodiment of the invention. Referring to FIG. 1, the multimediaapparatus 100 includes a processing device 110, a display module 120, aprojection device 130, a storage device 140, a wireless transmissiondevice 150, an audio receiving device 160, an audio playback device 170and a power supply device 180. In the present embodiment, the processingdevice 110 is coupled with the display module 120, the projection device130, the storage device 140, the wireless transmission device 150, theaudio receiving device 160, the audio playback device 170 and the powersupply device 180. In the present embodiment, the processing device 110includes connection ports 111 and 112 that complies with the SPI (SerialPeripheral Interface) and a connection port 113 that complies with theMIPI-DSI (Mobile Industry Processor Interface-Display Serial Interface)protocol. In the present embodiment, the processing device 110 iscoupled with the projection device 130 via the connection port 113.

In the present embodiment, the processing device 110 is a SoC (System onChip). However, the invention is not limited therein. In otherembodiment, the processing device 110 may also be a CPU (CentralProcessing Unit), a programmable general-purpose microprocessor, a DSP(digital signal processor), a programmable controller, an ASIC(Application-Specific Integrated Circuit), a PLD (Programmable LogicDevice), other similar processing device or a combination of thesedevices.

In the present embodiment, the display module 120 includes a firstdisplay 121 and a second display 122. The processing device 110 iscoupled with the first display 121 and the second display 122 of thedisplay module 120 via the connection ports 111 and 112, respectively.The first display 121 and the second display 122 may be, for example,PMOLED (Passive Matrix Organic Light Emitting Diode) displays, but theinvention is limited therein. In one embodiment, the display module 120may include one or more displays and is not limited by FIG. 1. Moreover,the displays of the display module 120 may be, for example, EPDs(electronic paper display), LCDs (liquid crystal display), LEDs(light-emitting diode), FEDs (field emission display) or other type ofdisplays.

In the present embodiment, the first display 121 and the second display122 are connected to ports 111 and 112 to receive the two sub-imagesignals provided by the processing device 110, respectively, and displaytwo sub-image pictures according to these two sub-image signals,respectively. In the present embodiment, the two sub-image picturesshown by the first display 121 and the second display 122 are suitablefor being combined into an emoticon graphic. For example, the firstdisplay 121 and the second display 122 may display a left-eye graphicand a right-eye graphic, respectively. The processing device 110 cancontrol the first display 121 displaying the sub-image picturerepresenting the left-eye graphic and the second display 122 displayingthe sub-image picture representing the right-eye graphic.

In the present embodiment, the storage device 140 may be, for example, aDRAM (dynamic random access memory), a flash memory or an NVRAM(non-volatile random access memory). In the present embodiment, thestorage device 140 stores the data and module described in theembodiments of the invention which are accessible and/or executable bythe processing device 110 so that the multimedia apparatus 100 canrealize the multimedia application operations described in theembodiments of the invention.

In the present embodiment, the wireless transmission device 150 may be,for example, a device compatible with Wi-Fi (wireless fidelity), WiMAX(worldwide interoperability for microwave access), Bluetooth, UWB(ultra-wideband) or RFID (radio-frequency identification) communicationprotocol. The wireless transmission device 150 establishes a wirelessconnection with another electronic device or cloud system having awireless transmission device via wireless transmissions, and cantransmit the voice data, image data and event-processing signalsaccording to the embodiments of the invention.

In the present embodiment, the power supply device 180 is for providingvoltage power to the processing device 110 to enable the processingdevice 110 to perform the multimedia operations according to theembodiments of the invention. Moreover, the power supply device 180 ofthe present embodiment may include a DC to DC converter to convert thevoltage level provided by an external power source or an internalbattery, but the invention is not limited therein.

FIG. 2 is a side view of the multimedia apparatus shown in FIG. 1.Referring to FIG. 1 and FIG. 2, the multimedia apparatus 100 may be, forexample, a smart speaker or a voice control speaker. One side of themultimedia apparatus 100 may have the display module 120, and anotherside may have the projection device 130. In the present embodiment, themultimedia apparatus 100 is disposed in the space formed by coordinateaxes P1, P2 and P3, wherein the coordinate axes P1, P2 and P3 areperpendicular to each other. The coordinate axes P1 and P2 may be, forexample, horizontal axes, and the coordinate axis P3 may be, forexample, a vertical axis. In the present embodiment, the multimediaapparatus 100 may be disposed on the surface S1 formed by the coordinateaxes P1 and P2, and the image picture can be projected by the multimediaapparatus 100 onto the surface S2 formed by coordinate axes P2 and P3.

In the present embodiment, the surface S1 may be a table surface, andthe surface S2 may be a wall surface. That is, a user can put themultimedia apparatus 100 on any table surface and interact with themultimedia apparatus 100 by voice. The user can see the emoticon graphicshown by the display module 120 at one side of the multimedia apparatus100, and see the image picture projected onto the surface S2 by theprojection device 130 at another side of the multimedia apparatus 100.However, the disposition, the projection direction and the appearance ofthe multimedia apparatus 100 is not limited by those shown in FIG. 2.The embodiment shown in FIG. 2 is just an exemplary example of onepossible implementation of the invention.

In the present embodiment, the audio receiving device is for receiving afirst voice data. The processing device outputs the first voice data viathe wireless transmission device. When the processing device receivesthe first image data corresponding to the first voice data via thewireless transmission device, the processing device operates theprojection device to project the first projection picture according tothe first image data. Several embodiments will be described hereinbelowas enabling examples of the invention.

FIG. 3 is a schematic diagram showing the multimedia system according toan embodiment of the invention. Referring to FIG. 1 and FIG. 3, themultimedia system 30 may include the multimedia apparatus 100 and acloud system 300. The cloud system 300 may be implemented in externalphysical servers and establish connections with the multimedia apparatus100 via wireless transmissions. In the present embodiment, the cloudsystem 300 includes a voice recognition device 310. The voicerecognition device 310 includes an NLU (natural language understanding)system 311 and a database 312. In one embodiment, the voice recognitiondevice 310 may be, for example, a skill kit utilizing Alexa VoiceService developed by Amazon, or a skill kit developed based on Amazon'sAlexa Voice Service. The invention is not limited therein.

In the present embodiment, the processing device 110 of the multimediaapparatus 100 may further include an AVS (Audio Video coding Standard)module 101, an event handler 102, a video/audio playback module 103, agraphic output module 104, an electronic device control module 105 and adisplay control module 106. The processing device 110 can execute thesemodules to implement the multimedia operations according to theembodiments of the invention. Specifically, in the present embodiment,when the multimedia apparatus 100 receives relevant image data or eventhandling signals from the outside via the wireless transmission device150, the event processing module 102 can drive at least one of thevideo/audio playback module 103, the graphic output module 104, theelectronic device control module 105 and the display control module 106to perform relevant multimedia operations.

It is noteworthy that the modules described in the embodiments of theinvention can be implemented by way of software or hardware circuits.For example. These modules can be stored in the storage device 140 forthe access and execution of the processing device 110. Alternatively,these modules can be implemented in the processing device 110 in theform of hardware circuits and/or chip modules, respectively.

In the present embodiment, the multimedia apparatus 100 receives thefirst voice data provided by the user via the audio receiving device160, encodes the first voice data VS1 via the AVS module 101 executed bythe processing device 110, and outputs via the wireless transmissiondevice 150. In the present embodiment, the processing device 110 of themultimedia apparatus 100 outputs the first voice data VS1 via thewireless transmission device 150 to the voice recognition device 310 ofthe cloud system 300. The voice recognition device 310 can recognize thefirst voice data VS1 via the NLU module 311, and obtain a second voicedata VS2 corresponding to the recognition result of the first voice databy searching the database 312. In the present embodiment, the cloudsystem 300 sends the second voice data VS2 to the multimedia apparatus100 via wireless transmission. Therefore, when the processing device 110receives the second voice data VS2 responding to the recognition resultof the first voice data VS1 via the wireless transmission device 150,the processing device 110 executes the AVS module 101 to operate theaudio playback device 170 to play the second voice data VS2.

That is, the multimedia system 30 has voice recognition functionality.The multimedia apparatus 100 can receive the first voice data VS1provided by the user via the audio receiving device 160, transmits tothe voice recognition device 310 of the cloud system 300 via wirelesstransmission, and performs the voice recognition function to the firstvoice data VS1 to obtain the second voice data VS2 corresponding to therecognition result of the first voice data VS1. Moreover, the multimediaapparatus 100 of the present embodiment can playback the second voicedata VS2 sent from the voice recognition device 310 via the audioplayback device 170. Therefore, the multimedia system 30 of the presentembodiment can provide multimedia operation functionalities such asvoice interactions, so that the user can provide voice messages to themultimedia apparatus 100 can receive corresponding voice feedbacks.Moreover, in the present embodiment, the processing device 130 of themultimedia apparatus 100 can drive the display module 120 via the imagesignal provided by the display control module 106 to display the imagepicture, wherein the image signal is determined in response to thesecond voice data.

In detail, FIG. 4 is a schematic diagram showing the emoticon graphicsaccording to one embodiment of the invention. Referring to FIG. 1, FIG.3 and FIG. 4, in the present embodiment, the multimedia apparatus 100generates the first sub-image signal and the second sub-image signal viathe display control module 16 in response to the second voice data VS2,and outputs the first sub-image signal and the second sub-image signalto the first display 121 and the second display 122 respectively,wherein the first display 121 displays the first sub-image pictureaccording to the first sub-image signal, and the second display 122displays the second sub-image picture according to the second sun-imagesignal. As shown in FIG. 4, the sub-image pictures 411 and 412 arecombined to be an emoticon graphic 410. The sub-image pictures 421 and422 are combined to be another emoticon graphic 420. The sub-imagepictures 431 and 432 are combined to be still another emoticon graphic430. The multimedia apparatus 100 can include image data of theseemoticon graphics 410, 420 and 430 shown in FIG. 4 by default. The twosub-image pictures displayed by the first display 121 and the seconddisplay 122 are suitable to be combined into one of the exemplaryemoticon graphics 410, 420 and 430. That is, the processing device 110can operate the first display 121 and the second display 122 to displayone of the emoticon graphics 410, 420 and 430 according to therecognition result (the second voice data VS2) of the first voice dataVS1.

For example, in one embodiment, the emoticon graphic 410 may be suitableto represent that the multimedia apparatus 100 is in a normal state or astandby state. The emoticon graphic 420 may be suitable for thesituation where the voice recognition device 310 failed to recognize thefirst voice data VS1. The emoticon graphic 430 may be suitable for thesituation where the multimedia apparatus 100 is in a sleep state. Thatis, the user can know the operation state of the multimedia system 30based on the emoticon graphic shown by the display module 120. However,the emoticon graphic of the invention is not limited by those shown inFIG. 4. In one embodiment, the emoticon graphics composed of the firstsub-image pictures and the second sub-image pictures of the displaymodule 120 can be selectively set by the user settings or the productrequirements.

FIG. 5 is a schematic diagram showing the multimedia system according toanother embodiment of the invention. Referring to FIG. 1 and FIG. 5, themultimedia system 50 may include the multimedia apparatus 100 and acloud system 500. Compared to the embodiment shown in FIG. 3, in thepresent embodiment, the cloud system 500 may include a voice recognitiondevice 510 and an external server 520. In the present embodiment, thevoice recognition device 510 may include an NLU module 511, a database512 and a TTS (Text-To-Speech) module 513. The external server 520 mayinclude a processing module 521 and a render module 522.

In the present embodiment, the multimedia apparatus 100 receives thefirst voice data provided by the user via the audio receiving device160, executes the AVS module 101 to process the first voice data VS1,and outputs via the wireless transmission device 150. The processingdevice 110 of the multimedia apparatus 100 outputs the first voice dataVS1 via the wireless transmission device 150 and transmits to the voicerecognition device 510 of the cloud system 500. The voice recognitiondevice 510 recognizes the first voice data VS1 using the NLU module 511and the database 512 and obtains the recognition result corresponding tothe first voice data VS1. In the present embodiment, if the recognitionresult of the first voice data VS1 provided by the user is, for example,a request of retrieving news information and the voice recognitiondevice 510 recognizes successfully, the voice recognition device 510 canfurther outputs the first instruction signal IS1 to the processingmodule 521 of the external server 520 according to the recognitionresult of the first voice data VS1, so that the external server 520 canretrieve the target data TD related to the news information from theInternet. In the present embodiment, the render server 522 of theexternal server 520 arranges the target data TD in a default dataarrangement format to generate the first image data AD1 and output tothe processing device 110. That is, the render module 522 of the presentembodiment can re-arrange the data format of the target data TD to makethe target data TD have a presentation format different from that of theoriginal data retrieved from the Internet.

In the present embodiment, the target data TD may be, for example,Internet search results such as specific news webpage information,weather information, or un-specific webpage information. The inventionis not limited therein. The event processing module 102 of theprocessing device 110 of the multimedia apparatus 100 can receive thefirst image data AD1 of the external server 520 via the wirelesstransmission device 150, and can drive at least one of the video/audioplayback module 103 or the graphic output module 104 to execute relatedmultimedia operations.

By driving at least one of the video/audio playback module 103 or thegraphic output module 104 by the event processing module 102 of theprocessing device 110, and by operating the projection device 130 toproject the first projection picture according to the first image dataAD1, the first projection picture corresponds to the first image dataAD1. That is, in the present embodiment, the user can send a voice datarequest to the multimedia apparatus 100. The multimedia apparatus 100can retrieve the target data TD of related news informationcorresponding to the voice data via the cloud system 500 and provide thetarget data TD to the user in the form of an image picture.

Moreover, it is noteworthy that the image data described in theembodiments may further include voice data, and the processing device110 can drive the video/audio playback module 103 to operate the audioplayback device 170 to playback the audio corresponding to the receivedvoice data according to the received voice data to accompany toprojection picture projected by the projection device 130.

More specifically, in the present embodiment, the voice recognitiondevice 510 can retrieve the target data TD from the Internet andtransform the target data TD into a third voice data VS3 by the TTSmodule 513 to output to the multimedia apparatus 100. Subsequently, themultimedia apparatus 100 receives the third voice data VS3 via thewireless transmission device 150, so that the processing device 110executes the AVS module 101 to process the third voice data VS3 tooperate the audio playback device 170 to playback the third voice dataVS3. That is, in the present embodiment, the user can send a voice datarequest to the multimedia apparatus 100, and the multimedia apparatus100 can retrieve the target data TD of news information corresponding tothe voice data from the cloud system 500 and provide the target data TDto the user in the form of a voice feedback.

Furthermore, the details of the devices and the modules of themultimedia apparatus and the external server of the present embodimentcan be found according to the teachings and explanations described inthe previous embodiments shown in FIG. 1 and FIG. 3, which are omittedhere for concise purpose.

FIG. 6 is a schematic diagram showing the multimedia system according toanother embodiment of the invention. Referring to FIG. 1 and FIG. 6, inthe present embodiment, the multimedia system 60 may include themultimedia apparatus 100 and a cloud system 600, wherein the cloudsystem 600 may include a voice recognition device 610 and an externalserver 620. In the present embodiment, the voice recognition device 610may include an NLU module 611 and a database 612. The external server620 may include a processing module 621.

In the present embodiment, the multimedia apparatus 100 receives thefirst voice data provided by the user via the audio receiving device 160and executes the AVS module 101 by the processing device 110 to processthe first voice data VS1 and outputs via the wireless transmissiondevice 150. The processing device 110 of the multimedia apparatus 100outputs the first voice data VS1 via the wireless transmission device150 to the voice recognition device 610 of the cloud system 600, so thatthe voice recognition device 610 recognizes the first voice data VS1 bythe NLU module 611 and the database 612 and obtains the recognitionresult corresponding to the first voice data VS1. In the presentembodiment, if the recognition result corresponding to the first voicedata VS1 provided by the user is a request of showing a second imagedata AD2 or the graphic data ID stored in the storage device 140 of themultimedia apparatus 100 in advance, that is, the recognition result ofthe first voice data VS1 is related to the second image data AD2 and thevoice recognition device 610 recognizes it successfully, the voicerecognition device 610 may further output a second instruction signalIS2 to the processing module 621 according to the recognition result ofthe first voice data VS1. The event processing module 102 of theprocessing device 110 of the multimedia apparatus 100 receives a firstevent-processing signal ES1 output by the processing module 621corresponding to the second instruction signal IS2 via the wirelesstransmission device 150.

In the present embodiment, the event processing module 102 of theprocessing device 110 may read the second image data AD2 or the graphicdata ID pre-stored in the storage device 140 based on the firstevent-processing signal ES1, and provide the second image data AD2 orthe graphic data ID to the video/audio playback module 103 or thegraphic output module. Therefore, the video/audio playback module 103 orthe graphic output module 104 of the processing device 110 may operateprojection device 130 to project the second projection picture based onthe second image data AD2 or the graphic data ID. In the presentembodiment, the second image data AD2 may be, for example, image data inthe MPEG-4 format having a resolution of 480p, and the graphic data IDmay be, for example graphic data of the BMP or JPG format having aresolution of 1280×720. That is, the multimedia system 60 of the presentembodiment can provide multimedia operating functions by voice control.The user can make a voice data request related to play the image data orthe graphic data pre-stored on the storage device 140, and send therequest to the multimedia apparatus 100. The multimedia apparatus 100recognizes the voice message of the user via the cloud system 600, andplays the second image data VD2 or the graphic data ID pre-stored in thestorage device 140 based on the recognition result corresponding to thevoice data.

The details of the devices and the modules of the multimedia apparatusand the external server of the present embodiment can be found accordingto the teachings and explanations described in the previous embodimentsshown in FIG. 1 and FIG. 3, which are omitted here for concise purpose.

FIG. 7 is a schematic diagram showing a multimedia system according toanother embodiment of the invention. Referring to FIG. 1 and FIG. 7, amultimedia system 70 may include the multimedia apparatus 100 and acloud system 700. In the present embodiment, the cloud system 700 mayinclude a voice recognition device 710 and an external server 720. Thevoice recognition device 710 may include an NLU module 711 and adatabase 712. In the present embodiment, the external server 720 mayinclude a processing module 721. The processing device 110 of themultimedia apparatus 100 is coupled with an external electronic device800 by wireless transmissions via the wireless transmission device 150,and controls the external electronic device 800 by outputting a controlsignal CS via driving the electronic device control module 105 of theprocessing device 110 according to the recognition result of the firstvoice data VS1.

In the present embodiment, the electronic device 800 may be, forexample, a lighting device, and the electronic device 800 may include awireless transmission device 805 and an electronic device control module806. The electronic device 800 can establish a connection with thewireless transmission device 150 of the multimedia apparatus 100 via thewireless transmission device 805. Moreover, the electronic devicecontrol module 806 of the electronic device 800 can receive the controlsignal CS provided by the electronic device control module 105 of theprocessing device 110 via the wireless transmission device 805.

In the present embodiment, the multimedia apparatus 100 receives thefirst voice data VS1 provided by the user by the audio receiving device160, and executes the AVS module 101 by the processing device 110 toprocess the first voice data VS1 and outputs via the wirelesstransmission device 150. The processing device 110 of the multimediaapparatus 100 outputs the first voice data VS1 via the wirelesstransmission device 150 to the voice recognition device 710 of the cloudsystem 700, so that the voice recognition device 710 recognizes thefirst voice data VS1 by the NLU module 711 and the database 712 andobtains the recognition result corresponding to the first voice dataVS1. In the present embodiment, if the recognition result correspondingto the first voice data VS1 provided by the user is a request ofadjusting the brightness of the lighting device, and the voicerecognition device 710 recognizes it successfully, the voice recognitiondevice 710 may further output a third instruction signal IS3 to theprocessing module 721 of the external server 720 according to therecognition result of the first voice data VS1. The event processingmodule 102 of the processing device 110 of the multimedia apparatus 100receives a second event-processing signal ES2 output by the processingmodule 721 corresponding to the third instruction signal IS3 via thewireless transmission device 150.

In the present embodiment, the event processing module 102 may drive theelectronic device control module 105 based on the secondevent-processing signal ES2, so that the electronic device controlmodule 105 generates the control signal CS corresponding to the secondevent-processing signal ES2. Moreover, the processing device 110 outputsthe control signal CS via the wireless transmission device 150 to thewireless transmission device 805 of the electronic device 800, so thatthe electronic device control module 806 of the electronic device 800can adjust the brightness of lighting device according to the controlsignal CS. That is, in the present embodiment, the user can make a voicedata request of adjusting the brightness of the lighting device andtransmit to the multimedia apparatus 100. The multimedia apparatus 100recognizers the voice data via the cloud system 700 and obtains therecognition result. Therefore, the multimedia apparatus 100 can adjustthe brightness of the lighting device based on the recognition result ofthis voice message.

However, the electronic device 800 of the invention is not limited to alighting device. In one embodiment, the electronic device 800 may alsobe another electronic device supporting IoT (Internet of Things)technologies, and may allow a user to control other electronic devicesvia the multimedia system 70 by voice. Furthermore, the details of thedevices and the modules of the multimedia apparatus and the externalserver of the present embodiment can be found according to the teachingsand explanations described in the previous embodiments shown in FIG. 1and FIG. 3, which are omitted here for concise purpose.

FIG. 8 is a schematic diagram showing a multimedia system according toanother embodiment of the invention. Referring to FIG. 1 and FIG. 8, amultimedia system 80 may include a multimedia apparatus 100 and anaccess management device 900. In the present embodiment, the accessmanagement device 900 may be, for example, an electronic device havingaccess management functions or a mobile device, a tablet, a portablepersonal computer, or an electronic device having video/audio functionsthat is connected with an access management system. The invention is notlimited therein. The multimedia apparatus 100 can establish a connectionwith the access management device 900 via the wireless transmissiondevice 150, and establish a further connection with the accessmanagement system via the application of the access management device900. Or, in one embodiment, the multimedia apparatus 100 can alsoestablish a connection with the electronic device having accessmanagement functions via the wireless transmission device 150 directly.The invention is not limited therein. In the present embodiment, theaccess management device 900 includes an access management module 901, anotification module 902 and a streaming module 903.

In the present embodiment, when a visitor operates the access managementdevice 900 (such as ringing a doorbell), the notification module 902 isfor providing a third event-processing signal ES3 to the processingdevice 110, so that the event processing module 102 of the processingdevice 110 can drive at least one of the video/audio playback module 103or the graphic output module 104 correspondingly. The event processingmodule 102 receives a third image data AD3 provided by the streamingmodule 903 of the access management device 900 via the wirelesstransmission device 150, and drives the video/audio playback module 103to operate the projection device 130 to project a third projectionpicture according to the third image data AD3. The third image data AD3may be the image of the visitor outside the door.

In other words, the multimedia system 80 of the present embodiment canprovide access management functions. The access management module 901 ofthe access management device 900 receives the image data outside thedoor, and drives the notification module 902 and the streaming module903 to generate the third event-processing signal ES3 and the thirdimage data AD3. In the present embodiment, the third image data AD3 maybe, for example, MPEG-4 image data having a resolution of 480p.

Moreover, the details of the devices and the modules of the multimediaapparatus and the external server of the present embodiment can be foundaccording to the teachings and explanations described in the previousembodiments shown in FIG. 1 and FIG. 3, which are omitted here forconcise purpose.

To sum up, the multimedia apparatus and the multimedia system of theinvention may have voice recognition and projection functions. After theuser provided voice data, the multimedia apparatus can executecorresponding multimedia operations and generate corresponding voicedata and projection image as a feedback. Moreover, the multimediaapparatus of the invention can display the emoticon graphic by thedisplay module as a response to the recognition result of the voice datato provide a personification responding effect. Furthermore, themultimedia apparatus of the invention can further be wirelessly coupledwith an external electronic device, such as a lighting device or anaccess management system, so that the user can remotely control thelighting device via voice data, or view the image outside the doorprovided by the access management system via the projection device.Therefore, the multimedia apparatus and the multimedia system of theinvention have interactive and diversified multimedia operationfunctions to provide good user experience.

Although the invention has been described with reference to specificembodiments, this description is not meant to be construed in a limitingsense. Various modifications of the disclosed embodiments, as well asalternative embodiments, will be apparent to persons skilled in the art.It is, therefore, contemplated that the appended claims will cover allmodifications that fall within the true scope of the invention.

What is claimed is:
 1. A multimedia apparatus, comprising: a wirelesstransmission device; an audio receiving device for receiving a firstvoice data; a processing device coupled with the audio receiving deviceand the wireless transmission device, the processing device being foroutputting the first voice data via the wireless transmission device,wherein the processing device is coupled with a voice recognition devicevia the wireless transmission device, the voice recognition device issuitable of recognizing the first voice data to obtain a recognitionresult; a projection device coupled with the processing device, when theprocessing device receiving a first image data corresponding to thefirst voice data via the wireless transmission device, the processingdevice operating the projection device to project a first projectionpicture according to the first image data; and a storage device coupledwith the processing device for storing a second image data, wherein therecognition result of the first voice data is related to the secondimage data, the voice recognition device outputs a second instructionsignal to an external server according to the recognition result,whereby the processing device receives, via the wireless transmissiondevice, a first event-processing signal outputted by the external servercorresponding to the second instruction signal, wherein the processingdevice retrieves the second image data from the storage device based onthe first event-processing signal and operates the projection device toproject a second projection picture according to the second image data.2. The multimedia apparatus according to claim 1, further comprising: anaudio playback device coupled with the processing device, wherein whenthe processing device receives a second voice data responding to therecognition result of the first voice data via the wireless transmissiondevice, the processing device operates the audio playback device to playthe second voice data.
 3. The multimedia apparatus according to claim 2,further comprising: a display module coupled with the processing device,the processing device further comprising a display control module, thedisplay module being for displaying an image picture according to animage signal provided by the display control module, wherein the imagesignal is determined responding to the second voice data.
 4. Themultimedia apparatus according to claim 3, wherein the display modulecomprises a first display and a second display, and the image signalcomprises a first sub-image signal and a second sub-image signal,wherein the first display displays a first sub-image picture accordingto the first sub-image signal, and the second display displays a secondsub-image picture according to the second sub-image signal, wherein thefirst sub-image picture and the second sub-image picture are suitable ofbeing combined into an emoticon graphic.
 5. The multimedia apparatusaccording to claim 4, wherein the first display and the second displayare passive matrix organic light emitting diode displays.
 6. Themultimedia apparatus according to claim 4, wherein the processing deviceis coupled with the first display and the second display via twoconnection ports conforming to the Serial Peripheral Interface protocol,respectively.
 7. The multimedia apparatus according to claim 1, whereinthe processing device is coupled with the projection device via aconnection port conforming to the Mobile Industry ProcessorInterface—Display Serial Interface protocol.
 8. The multimedia apparatusaccording to claim 1, wherein the voice recognition device outputs afirst instruction signal to an external server according to therecognition result of the first voice data, the external serverretrieves target data according to the first instruction signal from theInternet, and the external server arranges the target data in a defaultdata arrangement format to generate the first image data and output tothe processing device, the processing device receives the first imagedata of the external server via the wireless transmission device.
 9. Themultimedia apparatus according to claim 1, further comprising: anelectronic device coupled with the processing device via the wirelesstransmission device, wherein the processing device further comprises anelectronic device control module, the electronic device control moduleoutputs a control signal to the electronic device according to therecognition result of the first voice data.
 10. The multimedia apparatusaccording to claim 9, wherein the electronic device is a lightingdevice, and the voice recognition device outputs a third instructionsignal to an external server according to the recognition result of thefirst voice data, whereby the processing device receives via thewireless transmission device a second event-processing signal output bythe external server corresponding to the third instruction signal,wherein the electronic device control module of the processing devicegenerates the control signal according to the second event-processingsignal, whereby the electronic device control module operates thelighting device via the control signal.
 11. The multimedia apparatusaccording to claim 1, further comprising: an access management devicecoupled with the processing device via the wireless transmission device,the processing device further comprising an event processing module, andwhen the event processing module receiving via the wireless transmissiondevice a third image data provided by the access management device, theevent processing module of the processing device operating theprojection device to project a third projection picture according to thethird image data.
 12. A multimedia system, comprising: a cloud systemhaving a voice recognition device and an external server; and amultimedia apparatus, comprising: a wireless transmission device coupledwith the cloud system; an audio receiving device for receiving a firstvoice data; a processing device coupled with the audio receiving deviceand the wireless transmission device, the processing device being foroutputting the first voice data via the wireless transmission device tothe cloud system, the voice recognition device recognizing the firstvoice data to obtain a recognition result, and the external serveroutputting a first image data according to the recognition result of thefirst voice data to the processing device, wherein the processing deviceis coupled with the voice recognition device via the wirelesstransmission device; a projection device coupled with the processingdevice, when the processing device receiving the first image datacorresponding to the first voice data via the wireless transmissiondevice, the processing device operating the projection device to projecta first projection picture according to the first image data; and astorage device coupled with the processing device for storing a secondimage data, wherein the recognition result of the first voice data isrelated to the second image data, the voice recognition device outputs asecond instruction signal to an external server according to therecognition result, whereby the processing device receives, via thewireless transmission device, a first event-processing signal outputtedby the external server corresponding to the second instruction signal,wherein the processing device retrieves the second image data from thestorage device based on the first event-processing signal and operatesthe projection device to project a second projection picture accordingto the second image data.
 13. The multimedia system according to claim12, wherein the multimedia apparatus further comprises: an audioplayback device coupled with the processing device, wherein the voicerecognition device obtains a second voice data according to therecognition result of the first voice data, wherein when the processingdevice receives the second voice data via the wireless transmissiondevice, the processing device operates the audio playback device to playthe second voice data.
 14. The multimedia system according to claim 13,wherein the multimedia apparatus further comprises: a display modulecoupled with the processing device, the processing device furthercomprising a display control module, the display module being fordisplaying an image picture according to an image signal provided by thedisplay control module, wherein the image signal is determinedresponding to the second voice data.
 15. The multimedia system accordingto claim 14, wherein the display module comprises a first display and asecond display, and the image signal comprises a first sub-image signaland a second sub-image signal, wherein the first display displays afirst sub-image picture according to the first sub-image signal, and thesecond display displays a second sub-image picture according to thesecond sub-image signal, wherein the first sub-image picture and thesecond sub-image picture are suitable of being combined into an emoticongraphic.
 16. The multimedia system according to claim 15, wherein thefirst display and the second display are passive matrix organic lightemitting diode displays.
 17. The multimedia system according to claim15, wherein the processing device is coupled with the first display andthe second display via two connection ports conforming to the SerialPeripheral Interface protocol, respectively.
 18. The multimedia systemaccording to claim 12, wherein the processing device is coupled with theprojection device via a connection port conforming to the MobileIndustry Processor Interface—Display Serial Interface protocol.
 19. Themultimedia system according to claim 12, wherein the voice recognitiondevice outputs a first instruction signal according to the recognitionresult of the first voice data to the external server, the externalserver retrieves target data according to the first instruction signalfrom the Internet, and the external server arranges the target data in adefault data arrangement format to generate the first image data andoutput to the processing device, the processing device receives thefirst image data of the external server via the wireless transmissiondevice.
 20. The multimedia system according to claim 12, furthercomprising: an electronic device coupled with the processing device viathe wireless transmission device, wherein the processing device furthercomprises an electronic device control module, the electronic devicecontrol module outputs a control signal to the electronic deviceaccording to the recognition result of the first voice data.
 21. Themultimedia system according to claim 20, wherein the electronic deviceis a lighting device, and the voice recognition device outputs a thirdinstruction signal to an external server according to the recognitionresult of the first voice data, whereby the processing device receivesvia the wireless transmission device a second event-processing signaloutput by the external server corresponding to the third instructionsignal, wherein the electronic device control module of the processingdevice generates the control signal according to the secondevent-processing signal, whereby the electronic device control moduleoperates the lighting device via the control signal.
 22. The multimediasystem according to claim 12, further comprising: an access managementdevice coupled with the processing device via the wireless transmissiondevice, the processing device further comprising an event processingmodule, and when the event processing module receiving via the wirelesstransmission device a third image data provided by the access managementdevice, the event processing module of the processing device operatingthe projection device to project a third projection picture according tothe third image data.